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Abstract 

Non-uniformities  in  buffer  delays  and  wire  lengths  introduce  skew 
in  clock  distributif  n  trees.  Previous  techniques  exist  for  elimi¬ 
nating  skew  introduced  by  each  of  these  causes,  not  both.  This 
method  uses  a  pair  of  matched  variable  delay  lines  to  eliminate 
skew  caused  both  by  differing  buffer  delays  and  wire  lengths. 


This  report  describes  research  done  at  the  Artificial  Intelligence  Laboratory 
of  the  Massachusetts  Institute  of  Technology.  Support  for  the  laboratory's 
artificial  intelligence  research  is  provided  in  part  by  the  Advanced  Research 
Projects  Agency  of  the  Department  of  Defense  under  Office  of  Naval  Research 
contract  N00014-92-J-4097. 
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Introduction 


Timing  skew  in  clock  distribution  trees  is  caused  both  by  non-uniformities  in 

buffer  propagation  delay  and  wire  lengths.  Many  commercial  parts  exist  for 

solving  the  problem  of  non-uniform  buffer  delays.  They  work  by  regenerating 

the  clock  signal  at  the  redistribution  point  with  a  phase-locked  loop  (PLL)  I 

so  that  the  amplified  signal  is  in  phase  with  the  reference  signal  received. 

This  approach  ignores  the  delay  introduced  by  the  wire  used  to  deliver  the 
clock  signal  to  its  eventual  destination. 

[Knight  92]|3]  describes  a  method  for  compensating  for  wire  length  delays.  ^ 

While  the  technique  effectively  eliminates  skew  caused  by  wire  delay,  it  ig¬ 
nores  skew  caused  by  the  signal  buffers. 

The  following  technique  eliminates  skew  caused  both  by  non-uniform  buffer 

delays  and  difffering  wire  lengths.  As  in  the  technique  introduced  in  [Knight 

92],  a  pair  of  matched  variable  delay  lines  is  used  to  derive  the  one-way  travel  I 

time  from  a  measured  round-trip  delay.  The  method  can  be  implemented 

most  obviously  using  two  wires  from  the  source  to  the  destination,  but  can 

also  be  modified  to  require  only  one  wire. 


The  Technique 

Many  low-skew  clock  redistribution  buffers  work  as  depicted  in  Figure  1. 
A  reference  clock  signal  is  received  by  the  part.  Instead  of  amplifying  the 
signal,  and  hence  suffering  propagation  delays  that  are  hard  to  control  due  to 
part-to-part  and  temperature  variations,  a  local  voltage-controlled  oscillator 
(VCO)  is  employed  to  generate  a  new  copy  of  the  clock  signal  which  is  then 
amplified.  The  phase  of  the  amplified  copy  is  measured  against  that  of  the 
received  reference,  and  the  VCO  adjusted  to  correct  for  any  errors.  In  essence, 
the  part  functions  as  a  phase-locked  loop  with  the  amplified  signal  tracking 
the  frequency  and  phase  of  the  reference  input. 

The  problem  with  this  scheme  is  that  the  amplified  clock  must  then  be  trans¬ 
mitted  to  its  eventual  receiver  over  wire  with  delay.  This  delay  is  not  com¬ 
pensated  for  in  the  control  loop  of  the  PLL.  This  problem  can  be  eliminated 
if  the  wire  delay  is  included  in  the  signal  that  is  compared  by  the  phase 
detector  with  the  reference. 
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Figure  1;  Phase- locked  loop  redistribution  of  clock  signal.  The  PLL  compares 
the  signal  at  the  edge  of  the  buffer,  not  the  destination. 

A  circuit  as  shown  in  Figure  7  can  be  used.  The  amplified  ic  fed 

into  a  pair  of  matched  variable  delay  lines.  The  signal  is  also  sent  to  the 
receiver,  and  brought  back  along  a  path  with  the  same  electrical  length  as 
the  forward  path.  The  phase  detector  PDl  compares  these  delayed  signals, 
and  the  delay  lines  adjusted  in  tandem  until  the  two  delays  are  the  same, 
namely  2Titne  +  Tpdi  or  2Tdci<iy  +  Tpd2-  Assuming  Tpd-,  can  be  made  close  to 
Tpdi,  Tdeiay  =  and  the  signal  at  point  S  represents  the  same  timing  as 
the  signal  received  at  point  R.  Ph2ise  detector  PD2  compares  llie  signal  at  S 
against  the  reference  and  adjusts  the  phase  of  the  amplified  clock  until  the 
signal  at  S,  and  hence  the  signal  at  R,  is  the  same  in  both  frequency  and 
phase  the  reference. 


Figure  2;  The  two-wire  implementation  of  the  technique.  The  arrival  time 
of  the  amplified  clock  signal  at  the  receiver  is  derived  by  the  matched  delay 
lines  and  sent  to  the  PLL. 
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As  in  [Knight  92],  the  return  path  can  be  eliininated  by  nieeisuring  trails 
mission  line  effects.  If  the  receiver  hits  high  impedance  compared  to  the 
characteristic  impedance  of  the  line,  a  reflected  wave  of  tlie  same  sign  as 
the  outgoing  wave  will  appear  at  the  driver  after  one  round-trip  delay.  Series 
termination  at  the  driver  allows  observation  of  the  reflection  and  prevents  fur¬ 
ther  bounces.  If  the  series  termination  resistance  is  exactly  the  impedance  of 
the  wire,  then  the  voltage  at  the  wire  end  of  the  termination  resistor  doubles 
when  the  reflected  wave  returns.  The  circuit  is  as  shown  in  Figure  3. 
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Figure  3:  Using  transmission  line  reflection  to  measure  round-trip  delay. 
is  adjusted  so  tliat  the  comparator  triggers  on  the  return  bounce  of  the  signal. 


Conclusions 

The  described  technique  compensates  for  non-uniform  buffer  delays  and  wire 
lengths  to  allow  skew-free  redistribution  of  clock  signals.  The  technique 
utilizes  a  pair  of  matched  variable  delay  lines  to  derive  the  wire  delay  and 
incorporates  that  into  the  control  loop  of  a  PLL  clock  generator. 
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